Picture for Ruihao Gong

Ruihao Gong

Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing

Add code
Feb 02, 2026
Viaarxiv icon

Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge

Add code
Jan 26, 2026
Viaarxiv icon

MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping

Add code
Nov 19, 2025
Figure 1 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 2 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 3 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 4 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Viaarxiv icon

Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals

Add code
Oct 31, 2025
Figure 1 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Figure 2 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Figure 3 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Figure 4 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Viaarxiv icon

LLMC+: Benchmarking Vision-Language Model Compression with a Plug-and-play Toolkit

Add code
Aug 13, 2025
Viaarxiv icon

Post-Training Quantization for Video Matting

Add code
Jun 12, 2025
Viaarxiv icon

Pre$^3$: Enabling Deterministic Pushdown Automata for Faster Structured LLM Generation

Add code
Jun 04, 2025
Viaarxiv icon

QVGen: Pushing the Limit of Quantized Video Generative Models

Add code
May 16, 2025
Viaarxiv icon

Hierarchical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM

Add code
Mar 10, 2025
Viaarxiv icon

PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and Models

Add code
Dec 10, 2024
Viaarxiv icon